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Abstract 


An  ill-posed  linear  problem  Ax  =  y  in  Hilbert  space  is  considered  as  a 
filtering  problem  AX  +  Z  =  Y  for  Hilbert  space  valued  random  elements. 
Depending  on  the  models  for  the  signal  X  and  the  noise  Z,  the  solutions  of 
this  problem  are  discussed  in  the  context  of  cylinder  measures  on  Hilbert 
spaces  and  their  radonification  by  the  Abstract  Wiener  space  concept. 
Extensions  of  the  solutions  to  measurable  transformations  are  given 
explicitly.  The  filtering  solution  is  related  to  the  solution  of  the  problem 
Ax=y  obtained  by  Tichonov's  regularization  method. 


This  research  has  been  supported  by  AFOSR  Contract  No.  F  49620  82  C  0009 


fUVVV.V.V.A.V,  ^.V.  *„•  •  •  v. 


s mm 


1 


1.  Introduction 

Let  Hj  and  H.,  be  real,  separable  Hilbert  spaces  and  A:H, — »H.,  be  a  linear  bounded 
operator.  By  Hadamard's  definition,  a  linear  problem  Ax  =  y  is  well-posed  if  the  solution 
exists,  is  unique  and  depends  continuously  on  the  data;  otherwise  a  problem  is  called  ill 
posed. 

Examples:  1)  Fredholm  integral  equations  of  the  first  kind; 

If  QcRn  is  a  bounded  region,  kt  L-  ( Q2),  f€L2  (Q),  then 

Kftx):=  j  ktx,y)fTy)dy 
1  o 

is  Hilbert-Schmidt  in  L2(Q). 

Hence  the  linear  problem  Kf=  g  (gcL2  (Q)  given)  is  ill-posed. 

2)  A  linear  equation  Ax  =  y  in  Rn  may  be  numerically  ill-posed,  if 
det  A  <  <  1 

In  the  following  we  will  consider  such  problems  from  a  statistical  point  of  view  as 
introduced  in  the  work  of  O  N.  Strand  and  E.R.  Westwater  [11],  J.N.  Franklin  [2]  and  A. 
Uhlig  [13].  This  point  of  view  is  motivated  by  the  following  reasons: 

Many  methods  for  the  calculation  of  unknown  states  x  in  physical  or  technical  problems 
do  not  allow  to  observe  the  interesting  state  x  directly,  but  give  an  observation  y,  whose 
functional  relationship  with  x  may  be  described  by  a  linear  equation  Ax  =  y  as  above. 
Such  observations  often  may  be  affected  with  a  random  additive  noise.  Also  the 
unknown  state  x  may  depend  itself  on  a  random  law.  Thus  a  linear  inverse  problem 
Ax  =  y  often  is  already  an  approximation  for  an  equation  of  the  form  Ax  +  z  =  y,  where 
x,y,z  are  random  elements,  i.e.  we  have  a  filtering  problem,  namely  to  estimate  the 
unknown  state  x  given  a  noisy  observation  y. 

Of  course,  one  will  need  additional  information  to  solve  this  estimation  problem, 
assumptions  on  the  laws  of  the  random  elements  and  a  loss  function  to  optimize  the 
estimate.  However,  the  deterministic  methods  to  give  an  approximate  solution  for  an  ill- 
posed  problem  Ax  =  y  in  Hilbert  space,  for  instance  the  Tichonov  regularization  (  cf. 
[12]),  also  need  additional  information.  The  Tichonov  regularization,  i.e.  the  solution  of 
the  variational  problem  IIAx-yll2  +  aF(x)  =  min!,  requires  information  on  the  smoothness 
of  x  to  choose  the  regularization  parameter  a  and  the  regularization  functional  F. 
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To  give  estimates  of  the  approximation  error,  this  method  also  needs  to  know  a  compact 
set  containing  the  solution  x. 

Considering  the  problem  as  a  stochastic  filtering  problem,  those  assumptions  are 
replaced  by  statistical  assumptions  on  the  signal  and  the  noise. 

Let  (QJ^J3)  be  a  probability  space,  and  H,  real,  separable  Hilbert  spaces,  X  an 
H^valued  random  variable  on  (Q  .FJ3),  and  Y,Z  H.,-valued  random  variables  resp.  .  For 
a  linear  bounded  operator  ArHj — »H.,  we  consider  the  estimation  problem  AX+Z  =  Y, 
i.e.  we  want  to  give  an  estimate  x*  for  X(co)  given  Y(u>)  =  y. 

To  make  the  paper  self-contained,  in  the  following  we  shortly  summarize  the  well- 
known  solution  for  finite-dimensional  state  spaces  H,  ,H.,.  In  section  2  we  will  consider 
the  problem  for  infinite-dimensional  state  spaces. 

Finite-dimensional  state  spaces  H;  ,H, 


Notations:  i)  E(<X,h>)=  <x',h>(h£H1)forasuitablex'cH1; 

x'  is  called  the  mean  E(X)  of  X. 

ii)  E(<X-x',hl>  <X-x',h2>)  =  <Rh,,h2>  (h1,h2€H1)  with 

R  self-adjoint,  RsO,  and  for  X  centered,  (ek)  CONS  in  H,: 

di  mH 

E(11X1I2)=  V  <  Re  e,  >  =  Tr  R. 

k  =  1 

R  is  called  covariance  of  X. 


Assumptions: 


i)  X  and  Z  independent 


ii)  X  centered  with  covariance  R 


iii)  Z  centered  with  covariance  S 
We  look  for  a  linear  least  squares  estimate  L0:H2 — »H,,  i.e. 


(l)E(IIX-LoYII2)  =  min  {E(1IX-LYII2):L:H2—.H1  linear,  bounded} 


Let  Q:H2 — *Hj  be  the  cross-correlation  between  signal  X  and  observation  Y, 
i.e.  <hj,Qh2>  =  E(  <X,h1  >  <Y,h2>),(h,€H1,  h2£H2). 


Calculating  the  error  covariance: 

(2)  E(IIX-LYII-)  =  Tr.  [R  +  LKL*-LQ*-QL*]  =:F(L) 

where  K  is  the  covariance  of  Y  and  L*  denotes  the  adjoint  operator  of  L,  we  see  that  F  is 
a  quadratic  form  on  the  Hilbert  space  L(H„H,)  of  linear  operators  from  H  ,  into  H, 
endowed  with  the  Hilbert-Schmidt  norm 

dimH.-, 

11  L  ll2 :=  Y"  <  Lf  ,  f.  >  (if  I  CONS  in  H  i 

—  K  K  k  2 

k  =  i 

Choosing  Ln  such  that  L0K  =  Q,  one  obtains  F(L)>F(Ln)  =  Tr.  [R-L0KL0*] 

(L€L(H.,,Hj)). 

Thus  L(  solves  the  estimation  problem. If  X.Z  are  Gaussian,  one  explicitly  has  Q  =  RA*, 
K  =  ARA*  +  S.  and  therefore  L0  =  RA*(ARA*  +  S) '  ;  here  K  1  denotes  the  inverse  of  K  if  K 
is  invertible,  otherwise  K  1  means  the  left-pseudo-inverse  (K*K)  'K*  of  K. 

For  a  given  right  side  y  =  Y(co)  one  thus  has  as  LLS-estimate  (linear  least  squares 
estimate) 

(3)  x*  =  RA*(ARA*  +  S)  *y  =  L„y 
Remarks 

i)  If  X.Z  are  not  centered,  x*  =  E(X)  +  L0(y-A(E(  X)-E(Z))  is  LLS-estimate 

ii)  If  X.Z  are  independent  Gaussian  with  regular  covariances,  then 
LQY  =  E(XIY),  the  conditional  expectation  of  X  given  Y, 

and  x*  is  the  mean  of  the  conditional  distribution  of  X  given  Ytu>)  =  y 

iii)  L0  minimizes  the  error  of  linear  functionals  of  the  estimate,  i.e. 

(4)  E(<X-L0Y,h>2)  =  min{E(<X-LY,h>2):L€L(H>,H,)}  (heH,) 

iv)  If  A  is  invertible  and  S  =  0  (no  noise),  then  x*  =  A  'y,  the  deterministic  solution  of 
the  inverse  problem,  for  arbitrary  covariance  R. 

v)  A  connection  with  Tichonov's  regularization  method  is  given  by  the  following 
observation: 

The  functional  IIAx-yll2 +  o  <  R 'x.x  >  attains  its  minimum  at 
x*  =  RA*(ARA*  +  o2I)‘y  (cf.[12]h 
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Thus  Tichonov's  method  with  regularization  parameter  o  and  regularization  functional 
<R1.,.>  gives  the  LLS-estimate  under  the  assumptions  of  a  centered  signal  X  with 
regular  covariance  R  and  a  centered  white  noise  Z  with  covariance  o-’I  ,  I  identity 
operator  on  H 

2.  LLS-estimates  in  infinite-dimensional  state  spaces. 


V 
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In  the  following  let  and  H.,  be  real,  separable,  infinite-dimensional  Hilbert  spaces.  Of 
course,  the  problem  in  generalizing  the  results  to  infinite-dimensional  spaces  depends 
on  the  choice  of  the  mathematical  model  for  the  signal  X  and  the  noise  Z.  The  following 
considerations  may  clear  the  fundamentals  and  show  the  significance  of  the 
development  of  a  finitely  additive  filtering  theory  in  the  work  of  A.V.  Balakrishnan  [  1  ] 
and  in  a  series  of  papers  of  G.  Kallianpur  and  R.  Karandikar  (cf.  [8]). 

2,1  Gaussian  signal  and  noise  with  nuclear  covariances. 

For  convenience  ,  let  A:Ht — >H2  be  onto,  but  not  continuously  invertible.  Let  X,  Z  be 
zero-mean  Gaussian,  independent,  with  nuclear  covariances  R  and  S  resp..  and 
Kerf  ARA*  +  S)  =  {0}.  By  £(H  ,)  we  denote  the  Borel  measurable  subsets  of  H 
The  problem  is  again  to  give  an  estimate  x*  given  Y(eo)  =y,  where  AX  +  Z  =  Y. 

By  assumption,  the  operator  (ARA*  +  S)  has  dense  range  in  H,;  hence  it  exists  a  unique 
left-inverse  (ARA*  +  S) 1  with  domain  of  definition  rg(ARA*  +  S);  but  it  is  unbounded  as 
inverse  of  a  nuclear  operator.  Moreover,  the  well-known  crucial  point  is.  that 
rg(ARA*  +  S)  is  a  set  of  measure  zero  with  respect  to  the  distribution  of  Y: 

(5)  P°Y  1  (rg(ARA*  +  S))  =  0 


v 
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Proof.  Denote  ARA*  +S  =:  K;  then  rg(K)crg(K*).  If  (ek)  is  a  CONS  of  eigenvectors  of  K 
with  corresponding  eigenvalues  (,\k),  then  xf  rg< K - )  i ff  ( A ^  *<x.ek  >  HL(N). 

But  Ak  *<.,ek  >  is  standard  Gaussian  on  (H7,  £(H,),PoY  'khence 
x  <  ,e,  >‘ 

1  I  i 

'  -  is  PoY  “  —  a  e  divergent,  i  e  PoY  ~  ‘  <rg» K)i=  0 

k  =  1  ^k 


# 


-r  -  ■* 
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Thus,  in  this  case,  the  estimate  x*  =  L0y,  L0  —  RA*(ARA*  +  S) as  given  in  [2]  ,  is 
useless,  since  it  works  only  for  observations  y,  which  P°Y  1  -  a.  s.  do  never  appear. 

In  the  following,  it  is  shown  that  there  exists  an  extension  of  L0  to  an  operator  L  whose 
domain  of  definition  has  P°Y  1  -  measure  one,  i.e.  L  is  a  measurable  transformation, 
with  respect  to  P°Y 

Proposition  1,  Let  (ek),x  be  a  CONS  of  eigenvectors  of  K  with  corresponding 

eigenvalues  (.\k),N 

Let  (rk),*  be  a  CONS  of  eigenvectors  of  R  with  corresponding 
eigenvalues(pk),*. 

Then  it  holds  for  every  sf  rg(K): 

1  1  <s,  e  > 

1 6 1  L  s  =  ^  p*  ^  -  < 

o  —  0  —  \ 

j  =  l  i  =  i  i 

Defining  L  by  (6)  for  all  y(H2  satisfying 

*  1  <y,e  >  | 

\  N  - i —  <R*A*e,r>  "<•■*, 

—  J  —  X  1  J 

j=i  i=l  i 

one  obtains: 

i)  L  extends  L0 

ii)  L  is  measurable  and  P°Y  1  (D(L))  =  1, 
where  D(L)  is  the  domain  of  definition  of  L. 

iii)  x*:  =  Ly  is  the  mean  of  the  conditional  distribution  of  X  given 
y€D(L) 

Proof.  Clearly,  L  extends  L0,  the  domain  D(L)  of  L  contains  D(L0),  and  L  is  £(H2)- 
£(Hj)  -  measurable  as  a  limit  of  finite  sums  of  measurable  mappings.  One  has 
to  show  PoY  *(D(L))  =  1 

To  prove  this,  the  following  facts  will  be  used  (cf.  Gihman-Skorohod  [3]): 


R’A*e  ,  r  >  r 
‘  J  j 
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a)  A  well-known  theorem  of  Kolmogorov  states:  if  are  independent.  Si- 
valued  r.v.  such  thatE£k  =  0  (k€N)  and 

x  x 

^  E  1  Cj.  12<  x  .  then  ^  <  *  a  e 

k=i  k=i 

b)  If  (yj]1  are  R-valued  such  that 

x  x 

^  E  I  4k  1  <  x,  then  !  ck  1  <*  a  e 
k  =  1  k=l 

c)  With  the  above  notions 

*  <  Y, e  > 

Y  =  ^  \ 5  <  Y ,  e  > e  and  Y  =  - - — 

~  l  i  1  1  »  i 

1=1  V 

are  standard  Gaussian  (i€  N) 

d)  (ARA'  +  SHe^A  ie,  (i€N) 

e)  By  d)  Ls  is  transformed  to 

1  1  <s.  e  > 

Ls  =  ^  p  *  '''  <Ce  ,r  >  - - —  r 

—  rj  —  i  1  j  j 

,=1  ,=1  \ 

where  C:  =  R’A*(  ARA*  +  S)*;  consequently 

r  x 

LY='S‘  p  !  ^  <  Ce  ,  r  >  Y  r 

—  J  —  1  j  J 

j=i  i=l 

f)  For  C  in  e)  it  holds  IICII  <  1,  because: 

v€D(C)  =  rg(ARA*  +  S)*  =>  v  =  (ARA*  +  S)*u  for  suitable  u(H„ 
and  since  S>0 

IICvll2  =  llR*A*ull <  < ( ARA*  +  S)u,u >  =  IKARA*  +  S)4ull-  =  llvll2. 

Thus,  C  has  a  continuous  extension  to  all  of  H,,  say  Ce,  since  D(C)  is  dense  in 
H2.  Hence,  C*  =  Ce*  exists  and  IIC*II  <  1;  also  C**  =  Ce  exists  and  IIC**I!  <  1. 
It  holds  C*  =  (ARA*  +  S)  *AR* 


t 


Now,  set 


c  :  =  N  <Ce,r>^ 

1  —  i  J  j 

i  =  l 

Then  for  all  je  !«',  Cj  <  *  P  -  a.  e..  i.e. 


J1 


PoY ~  1  (  ?■  N  <Ct  .  r>  \  "5<s. 


e  >  -w  ,  =  i 


Namely, 

EY=0,EY2  =  1  (i€N)  and  thus 

x  x 

e(  <Ce  ,r  > Y  f=  N  <e  ,C*r  >“’ 

1=1  1=1 

and  a)  proves  g). 


Furthermore,  for  all  je  N  :  EC2  £  1.  Namely,  define 


_  V 


<Ce  .  r  >  Y 


and  observe  Y  independent  and 

n 

EC2  =  N"  <  Ce  ,  r  >2  s  1 

l.n  —  i  j 

i  =  1 

Thus,  by  g),  C,n  P  -  a.  e.  convergent  for  n — ♦=,  and  also  Cin2  is  a  P  -  a.  e.  convergent 
sequence  of  r.v.  such  that 

lim  c  =  C  • 


,lmnS.n  =  V 
Thus  by  Fatou's  lemma, 

x 

E  <^2  <  liminf  EC2  =  ^  <e  ,  C*r  >'  —  HC*r  11“  <  1 

j  n  },n  i  j  j 

l  =  1 

Eventually,  we  obtain 


11LY112=  V  p^2 

J=1  J 

converges  P  -  a.  e.,  since 

x  .  JO 

^  e|  p£2  s  V  P  =  Tr  R  <  00 

j=l  ]=1 

and  by  b), 

x 

^  p^2  <  «  P  —  a.  e. ,  i.e  PoY  “  1  (D(L))  =  1 
j=i  J 


^  S 


Now  the  estimate  is  set  x*:  =  Ly  for  y€D(L)).  To  show  assertion  iii)  of  the  proposition,  it 
is  remarked,  that  (X,  Z)  is  a  zero-mean  Gaussian  random  element  with  values  in  x  H 
and  covariance  operator 
j  R  O 
'  O  S  ^ 

Therefore,  by  the  independence  of  X  and  Z, 

/  I  O  /  X 

(X ,  Y )  =  1. 

'  A  I  l{  Z  1 

and  (X,Y)  has  covariance  operator 


R  RA 


AR  ARA 


/  I  O  /  R  O  /  I  A* 
*+S/~  A  I  O  S  O  I  ^ 


Consider  the  error  L,:  =  X  -  LY. 


It  is  not  immediate  that  C  is  Gaussian,  since  L  is  unbounded. 


Let<J)  .  Yl((h,,h2))  :=  E,.Ylexp[i  <«/¥).  (h,,  h,)>]  = 

/  I  -  L 

E,.v.V)exP  [<\  o  ,  )«*•>•). ‘h,.h?»  = 

E  x  yiexp[i<X,  h,  >  -  <LY,  h,  >  +  <Y.  h,>] 
be  the  characteristic  function  of  (£,Y). 

Let  L  denote  the  partial  sums  in  the  definition  of  L: 

L,j  :=  Ph,j  L0  Ph2‘  ,  where  H,J  =  sp{r, . r } ,  H =  sp  {e, . ej  ,  and 

?HkJ  the  orth.  projection  from  Hk  onto  HkJ  (k  =  1.  2  ;  i  j  i  N) . 

Then 

h)  L.(ARA*  +  S)  =  PHljRA*PH2‘ 

k)  L(JAR  =  PH,jR*CPh2'C*R}. 

Calculating  <J>1J(i  Yl,  denoting  analogously  the  char,  function  corresponding  to  L  ,  one  gets 

l) 

<J\<.Y>(hrh2)  =  ,im,im  ♦,'<  y,(hrh^ 

>  j 

by  continuity  of  the  norm  and  the  exponential  function  and  by  Lebesgue's  theorem. 
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I 

i 

I  o 

,  h. 

'  h  h.,i  =  E 

x.y,exP 

i<  lA',  Vi ,  ( 

-L*  I  1 

( h,  P  = 

U 

** 

R 

RA  * 

,  I  0 

(  h, 

I  O  ,  h. 

-  \  <( 

ii  .  i 

)• 

)  !> 

AR 

ARA*  —  S 

1  -L  l  1 

h, 

-L  I  '  h, 

e\P;-J  <(R-P  R!CP  C'R’ih  h  > 
[  I  h-  n,  11 


-RA*~P  RA*P  ih  h  >  -  <(ARA*  -  S  -h  , .  h 
H-  11',*  ‘ 


By  1)  one  gets 

nt  <J)  y  (hph,)  =  exp  {-^  [ < (R  -  R:C**C*Rr )  h, ,  hj  >  + 

( ARA*  +  S)  h_„  h„  >  ]}  = 

=  exp{-$-<D(h,,  h.,)  ,(h,,  h,)>}  where 

/  R  -  RiC**C*R‘  O 
D  =  (  I 

O  ARA* - S 

is  again  nuclear  and  non-negativ  in  H,  x  H 


Thus  (y,  Y)  is  Gaussian  in  H;  x  H„  zero-mean,  with  covariance  D. 

The  form  of  D  gives  independence  of  y  and  Y. 

Hence  (  is  zero-mean  Gaussian  with  covariance  R  -  R?C**C*R; 
Eventually,  X  =  y  +  LY  and  the  conditional  distribution  of  X  given  Y 
obtained  by 

P(X€B  I  Y  =  y)  =  P(y  +  LY  eB I Y  =  y)  =  PityB  -  LY  I  Y  =  y) 

=  P((eB  -  Ly)  =  P(y  +  Ly  €B)  (B  €  £(H, )) , 

i.e.  x*  =  Ly  is  the  mean  of  the  conditional  distribution  of  X  given 

Y  =  y,  y  €  D(L),  as  in  the  finite  dimensional  case. 


=  y,  y  € D( Lb  is 


/.v’v!  -r  iy !  v"y  ^!>)y 


mSm 


Remarks: 


i)  L  is  a  measurable  linear  estimator  which  dominates  the  class  of  continuous 
linear  estimators  T:H,  — *  Hj  without  necessarily  being  a  member  of  this 
class. 

ii)  Also  in  the  sense  of  (4),  L  gives  the  best  mean  square  estimate  for  linear 
functionals  <X,h  >  of  the  signal  X  (h€Ht  arbitrarily  fixed). 

iii)  If  X,Z  are  not  centered,  we  find  analogously  to  the  finite  dimensional  case 
x*=E(X)  +  L(y-A(E(X))-E(Z))  (y € D( L))  as 

LLS-estimate  for  x  with  respect  to  (1)  and  (4).  But  additionally  we  have  to 
require  AE(X)  +  E(ZKD(L)  to  insure  that  the  right  side  above  is  well  defined. 

iv)  If  the  correlation  K  of  Y  has  a  non-trivial  null-space,  then  one  considers 

H,  ©  Ker(K).  It  holds  P°Y  l(H,  ©  Ker(K))  =  1.  Thus,  we  have  a  unique  left- 
inverse  for  the  restriction  of  K  onto  H,,  and  obtain  a  measurable  linear 
extension  L  for  L0,  where  Ln  =  Q°(KIH,©  Ker(K))1,  by  restricting  the 
extension  procedure  in  Proposition  1  to  those  eigenvalues  of  K  which  are 
greater  than  zero.  Hence,  the  above  results  can  be  transferred  also  to  this 
case. 

v)  Let  signal  X  and  noise  Z  be  zero-mean  Gaussian  with  nuclear  covariances  R 
and  S  resp.,  but  correlated  with  cross-correlation 

Q  :  H2  — ►  H, .  If  a  joint  Gaussian  distribution  of(X,Z)  with  correlation 

R  Q  | 

Q*  S  / 

is  assumed,  then  L0  corresponds  to 
L,:  =  (RA*  +  QXARA*  +  AQ  +  Q*A*  +  S) 

Again  P°Y ‘(DfLj))  =  0.  Analogously  to  Prop.  1,  an  extension  of  the 
estimation  operator  L,  to  a  measurable  transformation  L  is  possible  under 
the  additional  assumption  that  rgQcrgRL 

Then  again  the  estimation  error  q=  X-LY  is  zero-mean  Gaussian,  and  Lis  the 
optimal  estimation  operator  with  respect  to  (1)  and  (4). 
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2.2  A  finitely  additive  cylinder  measure  as  noise  model 
2.2.1  Weak  random  variables 


We  make  the  following  assumptions: 

i'  The  signal  X  is  zero-mean  Gaussian  with  values  in  H,  and  nuclear 

covariance  R. 

ii)  The  noise  Z  is  a  zero-mean  Gaussian  weak  random  variable  with  bounded. 

strictly  positive  covariance  S:H,  — .  H,.  i.e.  a  zero-mean  Gaussian  cylinder 
measure  with  covariance  operator  S  is  associated  to  Z. 

ii i '  The  signal  and  the  noise  are  independent. 

We  look  again  for  an  estimate  of  x  in  the  problem  Ax-t-z  =  y,  where  A  is  onto,  but  not 
continuously  invertible,  and  Ker(  ARA*  +  S)  =  {0}  for  convenience. 

As  it  is  shown  in  [1],  the  following  result  holds: 

If  the  class  of  admissible  estimators  is  restricted  to  Hilbert-Schmidt 
operators  in  (1)  and  (4),  then  L0y:  =  RA*( ARA*  +  S)  'y  is  the  optimal  estimate 
for  x  in  the  problem  Ax  +  z  =  y  with  respect  to  the  modified  criteria  ( 1 )  and  ( 4 ), 
and  L0  is  Hilbert-Schmidt  itself. 


The  proof  is  the  same  as  in  section  1.,  if  the  restriction  to  Hilbert-Schmidt  operators  as 
admissible  estimators  is  made. 
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Remarks: 


.  V,  v.  „■ 


v  /'/  --  ^ 


Ljjy  cannot  directly  be  interpreted  as  mean  of  the  conditional  distribution  of 
X  given  y,  since  the  ’conditional  distribution’  of  X  given  an  observation, 
which  is  associated  to  a  cylinder  measure  on  H,,  does  not  exist  in  the 
countably  additive  sense. 

Only  for  linear  functionals  of  the  estimate  this  term  is  appropriate  in  the 
sense  of  modified  (4). 


The  restriction  to  zero-mean  random  elements  is  not  essential 
(cf.  2.1).  The  assumption  of  independence  of  signal  and  noise  may  be  replaced 
by  the  assumption  that  the  correlation  K  =  ARA*-t-S  of  the  observation  is 
strictly  positive. 

If  the  signal-covariance  R  is  allowed  to  be  non-nuclear,  then  the  error 
covariance  Tr.(R  4-  LKL*-LQ*-QL*)  is  not  defined 
(Q  cross-correlation  between  signal  and  observation). 

However,  if  K  is  continuously  invertible,  then  L0:  =  QK  1  is  the  best  estimator 
for  linear  functionals  in  the  sense  of  modified  (4). 

If  we  assume  the  signal  to  be  a  white  noise,  i.e.  a  zero-mean  Gaussian  weak 
random  variable  on  Ht  with  covariance  o2I,  and  the  noise  as  well  zero-mean 
white  noise  on  H,  with  covariance  o2I,  then  L0y  =  o02A*(o02AA*  +  o2I)  ’y  is  the 
solution  of  the  Euler  equation  ( A*A  +  aB)x  =  A*y  (y€  H2),  where 


a  =  —  and  B  =  I 


That  means,  that  the  solution  of  the  Tichonov  regularization 
IIAx-yll2  +  aF(x)  =  min!  for  the  ill-posed  linear  problem  Ax  =  y  is  given  by  our 
estimate  in  the  filtering  problem  Ax  +  z  =  y,  if  the  regularization 
parameter  a  and  the  regularization  functional  F  are  chosen  to  be 


"•I 


and  F(x)  =  llxll2  (cf.  [12]). 


v*  v.ViV.x  r 


OPC. ■r-  *r.  <r. 


«  *  .  "  .  •  .  ’  .  ^  .  •*  -  *  a*  « 


2.2,2  Radonification  of  the  cylindrical  model  by  the  concept  of  Abstract  Wiener  Space 

In  the  foregoing  section  we  have  seen  that  the  solution  of  the  filtering  problem 
AX  +  Z  =  Y  for  a  noise  model,  which  is  only  a  finitely  additive  cylinder  measure  on  the 
observation  space  H.„  gives  an  estimate  LnY.  which  is  optimal  for  linear  functionals  of 
the  signal,  i.e.  E(  <  X-L0Y,  h  >2)  =  min  {E(  <  X-LY,h  >  -’)  :  L  :  H,  — ♦  Hj  Hilbert-Schmidt 
operator}.  The  observation  measure  P°Y  1  in  this  case  also  is  a  finitely  additive  cylinder 
measure  on  H.v  In  this  framework,  for  instance  in  [1]  the  Kalman-Bucy  filter  equations 
are  developped  as  equations  on  the  observation  space,  related  to  the  finitely  additive 
cylinder  measure  P°Y  1  .  In  order  to  get  an  interpretation  of  the  estimate  as  conditional 
distribution  of  X  given  Y,  in  the  usual  countably  additive  sense,  the  concept  of  Abstract 
Wiener  space  is  commonly  used.  This  means,  that  the  observation  space  H,  is  embedded 
into  a  larger  space  W  in  such  a  way  that  a  radonification  of  the  cylinder  measure  to  a 
countably  additive  probability  measure  on  the  Borel  sets  of  W  is  possible  (cf.  [4],  [9]): 

Definition,  (cf.  [4]) 

Let  H  be  a  real,  separable  Hilbert  space,  and  W  a  real  Banach  space,  and  p  a  zero-mean 
Gaussian  cylinder  measure  on  H. 

Let  i:  H  — -W  be  a  continuous  injection  with  dense  range  in  W.  The  triple 

(i,  H,  W)  is  called  an  Abstract  Wiener  space  .  if  the  norm  II. II,  of  W  is  measurable  with 

respect  to  H  and  p  in  the  following  sense: 

For  every  e>0  there  is  a  finite-dimensional  projection  Pc  on  H  such  that  for  every 
finite-dimensional  projection  P  on  H  orthogonal  to  Pc  we  have: 
p({x  €  H:  IlifPxlllj  >t})<£ 

Examples  and  further  references  can  be  found  in  [9], 

To  apply  this  concept  to  our  filtering  problem  AX  +  Z  =  Y  we  make  again  the 
assumptions  that  X  is  a  zero-mean,  Gaussian,  Hj-valued  random  element  with  nuclear 
covariance  R,  Z  is  zero-mean  Gaussian  white  noise  on  H0  with  covariance  I,  and 
independent  of  X.  Then  the  weak  random  variable  Z  defines  a  zero-mean  Gaussian 
cylinder  measure  v,  such  that  for  the  corresponding  outer  measure  v*  holds  the  well- 
known  fact  v*(H2)  =  0  (cf.  [5]).  The  cylinder  measure  v  does  not  have  a  countably 
additive  extension  to  the  Borel  sets  £(H2)  of  H2,  since  in  a  certain  sense  H2  is  too  small. 
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Thus,  one  extends  H,: 


Definition.  Let  C:H2  — ►  H.,  be  an  arbitrary  linear  operator  which  is 


i)  self-adjoint,  positive  semi-definite 


ii)  injective 


iii)  Hilbert-Schmidt 


Define  <x,  y  >,:=  <Cx,  Cy>  (x,  y  €  H.,),  II. II,  the  corresponding  norm,  and 
let  W  be  the  completion  of  H  with  respect  to  the  norm  II. II 

It  is  shown  in  [4],  that  II. II,  is  a  measurable  norm  with  respect  to  v,  the  weak  distribution 
corresponding  to  Z.  Thus  the  triple  (i,  H2,  W),  i:H2  — .  W  the  canonical  injection,  is  an 
Abstract  Wiener  space.  The  Gaussian  cylinder  measure  v  induces  a  Gaussian  cylinder 
measure  vr  on  the  cylinder  sets  of  W  by 

vf  ({w€W:(w,'(w) . wn'(w))€B}):  =  v({h5H.,:(w,’(ih),  ...,wn'(ih))eB})  for  every  choice  of 

w,\  ...  ,wn'(:W,  n€N,  B€£(En). 

According  to  [4] ,  vc  possesses  a  countably  additive  extension  vc'  to  the 
o-algebra  generated  by  the  cylinder  sets  of  W.  Since  H„  is  separable,  this 
o-algebra  coincides  with  the  Borel-o-algebra  on  W  (cf.  [10]). 

Again  ,  vc'  is  zero-mean  Gaussian. 

We  denote  the  correlation  operator  of  this  measure  vc’  by  G,.  For  the  restriction  G,IH  , 
of  G,  onto  H2  (identified  with  the  subspace  i  (H,)  in  W)  we  have: 

Lemma  1.  G,IH2  =  C2 

Proof.  For  h,,  h2  cH2,  the  function  <.  ,h,>,<.  ,h2>,  :  W — .R  is  a  cylindrical 
function.  Hence,  by  definition  of  vr\ 

<hl’Glh2>l  =  !w<w’h1>.<w.h2>1dv'c(w)=  [  <h,  h,  > ,  <  h.  h,,>  ,dv  (h)  = 

L  <h.C2h1xh,C2h2>dv(h)=  <C2h,,C2h,>  =  <h,,C2h2>1 


# 
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Now,  we  consider  the  equation  AX  +  Z  =  Y.  We  transfer  canonically  the  probability 
measure  Po(AX) 1  onto  £(W): 

p:  =  Po(AX) 1 

p’^poi1  ,  i.e.  p’(B)  =  p  ({h£H,:i(h)€B})  (B€£(\Yn 
p'  is  zero-mean  Gaussian  and  for  its  correlation  operator  G  ,:\V — -W  holds: 


Lemma2.  G2IH2  =  ARA*C2 


Proof.  For  h.,  h2  €H.,, 


chj,  G2h.,>i  =  |  <w.  h1>l<w,h,>J  dp'iwi 


<h,C*h1>  <h,C‘h2>  dp  (hi  =  cC'-’ty  ARA*C’h,>  =  <h  ARA*C2h.,> 


The  sum  of  the  Gaussian  random  variables  on  W,  corresponding  to  vc'  and  p',  has  a 
measure  p'  as  its  distribution,  p'  is  also  zero-mean  Gaussian  and  has  G3:  =  G,  +  G.,  as 
correlation  operator,  due  to  the  assumed  independence  of  v  and  p  and  therefore  of  vc' 
and  p'.  We  now  have  immediately: 

Lemma  3.  i)  G3  is  nuclear 

ii)  G3IH2  =  (ARA*+I)C2 

iii)  p'  is  the  radonification  of  the  cylinder  measure  p°i  where  p 
denotes  the  weak  distribution  of  Y  in  H2. 

For  the  cross-correlation  G4:W — »H,  between  the  signal  and  the  observation  measure 
holds 


f  * 
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Lemma  4.  GJH2  =  RA*C2 


Proof.  For  h,  £H,,  h.,  €H„, 

r 

<  h  G  h,,  >-1  <  h.  h  >  <  w.  h ,  >  P<>X  1  idh  p'  i  - 

-HsW  1-1 


|  <h,  h  ><h'  h  >  PoX  'idh'p'dh'i  - 

•h,xh. 


I  -h.h  >-h',C  h,>  PoX  '  1  tdhi  (j 'dn'i  -  <h  RA*C'h,> 

JHjOl.,  ‘ 


+:-\ 

«*.v, 

/  V 


If  we  summarize  now  these  observations,  we  can  state  that  the  assumptions  of  2.2.1  are 
satisfied  for  the  transformed  model  in  the  larger  space  W.  Analogously  to  Proposition  1. 
we  obtain  an  extended  estimation  operator  LAV — -H  which  is  a  measurable 
transformation  ,  i.e.  p'  (D(L))  =  1.  Analogously,  Lw  gives  the  mean  of  the  conditional 
distribution  of  X  given  w€D(Lh  L  is  the  extension  of  L  =  G4G , 1  ,  where  p'  (D(Ln))  = 
p'lrgG ,)  =  0  again.  L  can  be  given  explicitly  on  the  dense  subspace  rg((  ARA*  +  LC  ')  of 
W: 

Proposition  2  L  is  given  on  rg((ARA* +I)CJ)  by 

L  I  rg((ARA*  +  I)CJ)  =  RA*(ARA*  +  I)  ‘ 

Proof.  LI  rg((ARA*  -t-DC-’i  =  G4 1  H,  °G  .  1 1  rg((ARA*  +  1)0’) 

=  RA*C-((ARA*  +  DC-) 1  =  RA*(ARA*  +  I) 1 
by  Lemma  3  and  Lemma  4. 

Thus,  a  solution  of  the  above  linear  filtering  problem  is  given  in  a  countably  additive 
framework  by  using  the  concept  of  radonification  of  the  cylinder  measures  in  Abstract 
Wiener  space. 


Remark  that  the  solution  is  independent  of  the  renorming  operator  C  by  which  the 
Abstract  Wiener  space  was  constructed.  It  is  formally  equal  to  the  'cylinder  measure 
solution'  and  differs  from  it  only  by  its  domain  of  definition,  which  makes  it  a 
measurable  transformati  n  from  W  to  H  with  respect  to  p’.  Hence,  the  martingale 
approach  to  the  Kalman-Bui filter  in  the  countably  additive  theory,  as  worked  out  ;n 
[7],  gives  formalh  the  same  result  as  the  'cylinder  measure  filter'  in  !lj.  We  conclude 
with  an  argument  that  shows  the  significance  of  the  finitely  additive  filtering  theory, 
recently  worked  out  by  G.  Kallianpur  and  R.  L.  Karandikar  [7],  namely  that  the  space 
H  ,  of  the  actual  observations  has  zero  prohabiltiy  with  respect  to  the  radonified 
observation  measure  p'  in  W. 

Proposition  2_  With  the  above  notations,  identifying  H  ,  with  i(  H  ,).  it  holds: 

p’lH  ,i  =  i) 

Proof:  As  is  well-known,  W  is  isometrically  isomorphic  to  a  closed  linear 

subspace  F  of  the  space  C[0,1]  of  continuous  functions  on  the  unit 
interval.  If  this  isomorphism  is  denoted  by  \p:W — >F,  and  H  denotes  the 
reproducing  Kernel  Hilbert  space  generated  by  the  covariance  of  the 
Gaussian  measure  vr’o\p  then  H,  =  vp  ‘(H),  according  to  [6],  If  B  is  the 
covariance  of  vr'o\p  then  H  if!  x(rg  (B:),  due  to  [3].  Thus  v(  '(H  )  = 
vr'oq,  *  (rg(B-))  =  0. 

The  last  equality  holds  analogously  to  ( 5 ) . 

Eventually,  using  a  result  of  G.  Kallianpur  and  R.L.  Karandikar  on 
mixtures  of  translates  of  the  canonical  Gauss  measure  (cf.  [8]),  we  have 

t 

p‘!H.>)=  )  vy'lH.,— h)p<dh>  =  0 
lH2 

# 

If  Z  is  a  white  noise  in  the  model  AX  +  Z  =  Y  on  an  observation  Hilbert  space  H  .  (often  of 
smooth  functions;  for  example  H2  the  RKHS  of  the  Brownian  motion),  this  precisely 
means,  that  the  filter  solution  with  respect  to  the  radonified  observation  measure  p’  on 
Abstract  Wiener  space  W  does  not  have  a  practical  meaning  as  long  as  observations  are 
considered  to  be  elements  of  the  space  H ,  (cf.  [8]),  i.e.  as  long  as  they  appear  with  zero 
probability. 


I  would  like  to  thank  Professor  G.  Kallianpur  for  his  kind  invitation  to  the  Center  for 
Stochastic  Processes.  I  am  greatly  indebted  to  him  and  also  to  Professors  S.  Cambanis 
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